Lensing of the CMB: Non Gaussian aspects 
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We study the generation of CMB anisotropies by gravitational lensing on small angular scales. We 
show these fluctuations are not Gaussian. We prove that the power spectrum of the tail of the CMB 
anisotropies on small angular scales directly gives the power spectrum of the deflection angle. We 
show that the generated power on small scales is correlated with the large scale gradient. The cross 
correlation between large scale gradient and small scale power can be used to test the hypothesis 
that the extra power is indeed generated by lensing. We compute the three and four point function 
of the temperature in the small angle limit. We relate the non-Gaussian aspects presented in this 
paper as well as those in our previous studies of the lensing effects on large scales to the three and 

I*^ I four point functions. We interpret the statistics proposed in terms of different configurations of the 

Q^ . four point function and show how they relate to the statistic that maximizes the S/N. 
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I. INTRODUCTION 



The anisotropies in the Cosmic Microwave Background (CMB) are thought to contain detailed information about 
the underlying cosmological model. In conventional models the anisotropies on most angular scales were created at 
^Zq • the last scattering surface, at a redshift of z ^ 1000. At these early times the evolution of perturbations can be 
^\ ] calculated accurately with linear theory. The calculation of theoretical predictions is almost straightforward, thus 
TJ" ■ detailed observations of the microwave sky can, at least in principle, greatly constrain the cosmological model. We 
^^ I expect to be able to measure many of the parameters of the cosmological model with percent accuracy 0| . 

There are several physical processes that imprint anisotropies on the CMB after decoupling. Some of them will 
^y. ' degrade our ability to learn about cosmology, like foreground emission from our galaxy. Others will allow us to 
constrain processes that happen after decoupling and help us understand the evolution of our universe. For example 
the reionization of hydrogen by the ultraviolet light from the first generation of objects leaves a distinct mark in the 
polarization of the CMB [gj, Sunyaev-Zeldovich emission from hot gas along the line of sight creates temperature 
O anisotropies and the large scale structure (LSS) of the universe deflects the CMB photons, lensing the anisotropies 

S ill- 

jrt . When studying the lensing effect produced by the large scale structure of the universe we are trying to detect lensing 

!• ' produced by random mass fluctuations, the LSS, on a random background image, the CMB. The characteristics of 

. !^ ] the lensing effect depends on the relative size of the coherence lengths of these two random fields. In MM we studied 

S^ ■ the limit of a rapidly fluctuating CMB background being lensed by a slowly varying mass distribution. This is the 

Jh [ appropriate limit to recover the power spectrum of the projected mass density on scales much larger than the coherence 

- - ' length of the CMB, S, ~ 0.15°. In this paper we study the opposite limit, the generation of power on scales much 

smaller than ^. 

The lensing effect is expected to be the dominant non primordial contribution to the CMB anisotropies on small 

scales {I ~ 3000). It has been shown 0] that an accurate determination of the power generated by lensing can help 

break some of the parameter degeneracies in the CMB. Interferomentric observations of the anisotropies such as those 

that will be carried out by CBI [g| are designed to make measurements at these angular scales. 

To be able to use the observed power on small scales to break the degeneracies in the parameters one must be sure 

that one is observing the lensing signal. We will show that the small scale power generated by gravitational lensing 

has a very definite signature, it is correlated with the large scale gradient. Regions of the sky where the large scale 

gradient is larger will have more small scale power. The physical effect can be understood easily in the case of a 

cluster of galaxies lensing a smooth CMB gradient. 

We will also compute the general three and four point function of the temperature field induced by lensing and 

show that both the statistic discussed in this paper and those proposed in Q are particular subsets of the possible 

configurations of the four point function. We will construct the statistic that maximizes the signal to noise ratio. 



II. GENERATION OF POWER ON SMALL SCALES 

The measured temperature field T{9) can be expressed in terms of the unlensed CMB field at the last scattering 
surface T{6) and the deflection angle of the CMB photons 56, 

T{e) ^f{e + S0) 

^m + se.wm + iso.soA^no). (1) 

In this paper we are interested in the effect of modes of the deflection angle of spatial wavelength much smaller than 
that of the unlensed CMB. 

A. A toy example 

To understand the physics we will first consider the lensing induced by a cluster of galaxies. We will discuss a very 
simplified example in this paper, the reader interested in the detectability of the effect for real cluster should look 
at reference [Q. In most cases a cluster will subtend a few arcminutes on the sky. Over such a scale the primary 
anisotropics are expected to have negligible power so we will treat the unlensed temperature field as a pure gradient. 
Without loss of generality we will take the gradient to be along the y axis with an amplitude Tyo- The observed 
temperature becomes, 

T{e)^fyo{0y+S0y). (2) 

For a spherically symmetric cluster at the origin the deflection is of the form, 

se = -56^. (3) 

For a singular isothermal sphere SO — ATr{ay/c)'^DLs/ DqSi with a^ the cluster velocity dispersion, D^s the distance 
from the lens to the source and Dqs that between the observer and the source. In an Einstein DeSitter universe 
DlsI Dos ~ 1/Vl + ^L, with zl the redshift of the cluster and where we have assumed that zl is much smaller than 
the redshift of recombination. A singular isothermal profile will be a good approximation for a cluster up to some 
maximum radius after which the deflection will fall as \/9. In the central part of the cluster there might be a core. 
The typical value for 50 is a fraction of an arcminute. 

The effect of lensing can be understood by looking at figure ||. We focus on the temperature as a function of Oy 
for a fixed O^- In the absence of lensing we would observe the gradient. On the other hand the cluster will defiect 
the light rays so that for Oy > Q the rays are coming from a lower value of Oy in the last scattering surface. If the 
gradient is positive this implies that for Oy > Q in the presence of the cluster we would observe a lower temperature 
than what would be observed if the cluster was not there. The opposite is true for Oy < 0. Far away from the cluster 
the lensed temperature should coincide again with the gradient. Thus the cluster creates a wiggle on top of the large 
scale gradient. This effect is shown in figure 0. The size of this wiggle is Ty^SO, thus it is proportional to the size of 
the gradient and the defiection angle. For a cluster we can use the determined deflection to infer the mass and some 
information about the cluster profile. 

Note that the proposed signal is directly sensitive to the deflection angle and not the shear. This is the only method 
that has this property. It is usually argued that we can never measure the deflection angle in a lensing system because 
we do not know the original position of the background image. In this case we avoid this argument because we do 
know what the background image is, it is a gradient which we can measure on larger scales than the cluster. The 
other important point is that the effect is proportional to the gradient TyQ, this is the signature we will use in what 
follows to identify the lensing effect by large scale structure in the limit we are studying. 

B. Large scale structure 

For simplicity we will work in the small angle approximation so that we can expand the temperature field in Fourier 
modes instead of spherical harmonics. The Fourier components are defined as, 
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FIG. 1. In the upper panel we show a cluster lensing a background gradient. The bottom panel shows the temperature 
measured for a fixed O^ as a function of 9y in the presence and absence of the cluster. Points with 8y > get deflected to a 



the unlensed one. The opposite is true if 6'j, < 0. 
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FIG. 2. Temperature profile of a CMB gradient lensed by a cluster. We took Tyo59 = 13^K. Tlie gradient part lias been 
subtracted out for clarity. The cluster profile was cut-ofi' at 4 arcmin. 



T{1) = f (f9e'^-^T{e) 
56(1)= f d^9e'^-^S9{9) 



(4) 
We assume both the CMB and the deflection angle are Gaussian random fields characterized by their power spectra, 

{f{h)f ih)) - (2^)2<5^(Zi2)Cr 
{S9{h)- 69(12)) = {27:)'S''{h2)Ct', (5) 

where I12 = li + 12, Cf^ is the power spectrum of the primary CMB anisotropics and Cf^ is the power spectrum of 
the deflection angle. 

To leading order in the deflection angle the Fourier components of the lensed CMB field are, 

T{i) = f{i) + j ^ mn ■ ^ni - n- (6) 

We want to calculate the power spectrum of the lensed CMB field (Cf^) on very small scales, scales much smaller 
than the coherence length of the unlensed CMB gradient. We start by considering a small patch of the sky of size L, 
with L small enough that the gradient of the unlensed field can be considered constant so that the unlesed CMB map 
can be approximated linearly. On these scales, the small scale variations of the deflection angle generate additional 
power in the lensed CMB field. We define fx{9) = f^ and fy{9) = fyo so that, 

T(0)=f,o [0^ + S9^{9)]+fyo [9y + S0y{9)]. (7) 

For the Fourier modes at large I we get, 

Tii)^f,oSe^ii) + fyo6eyii). (8) 

Thus the power spectra of the lensed temperature is given by, 

{T{h)T{l2)) = {2^)H^{h2)osCf'j2, (9) 

with as = (T^o + Tyo)- For SCDM as « 2 x l{f ^iKrad^^ « IS/iiiTarcmin^^ Note that to obtain equation (||) we have 
replaced T^p ^"^^ ^»o by their averages. This implies that we are measuring the power spectrum over a large enough 
area of the sky that there are many patches over which the gradient is averaged. Equation (^ is a very interesting 
result on its own, it shows that the power in the tail of the CMB anisotropies directly gives the power spectrum of the 
deflection angle. This result can also be derived by taking the appropriate limit in the full expression for the lensed 
CMB spectra as derived for example in |y. 

In figure we show the various power spectra so as to gain some insight on the orders of magnitude. In panel a 
we show the lensed and unlesed CMB spectra together with asC^^ /2, the limiting value in the damping tail. Several 
conditions must be met for equation (S) to be valid. A very important fact is that the power spectra of both the 
deflection angle and the CMB derivative fall with I. This means that when we look at the power generated by lensing 
in a high I mode of the temperature, it could be dominated by a nearby I of the temperature derivative multiplied by a 
low I deflection angle mode rather than by the power coming from a small I temperature derivative mode multiplied by 
a high I deflection mode. Equation (||) is only valid in the latter case. We attain this limit because the power spectra 
of the deflection angle falls more slowly with I than that of the CMB derivative. The power in the deflection angle falls 
like a power law (figure Be) while than in the CMB derivative drops exponentially (figure rab). As a consequence at a 
high enough I it is always easier to produce power by multiplying low frequency temperature modes by high frequency 
deflection modes. At I > 4000 this effect dominates, at lower I both effect compete. Another condition is that we 
should apply equation (j^ for I values large enough that most of the power in the unlensed derivative (contributing 
to as) comes from a lower I. Figure ^d shows that for SCDM most of the power in the derivative comes from scales 
I < 2000. 
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FIG. 3. The panel (a) shows the lensed and unlensed temperature power spectra together with asCi /2. Panel (b) the 
power spectra of the derivative of the unlensed CMB field, (c) the spectra of SO and (d) the cumulative power in the CMB 
derivative. 



III. CORRELATION BETWEEN SMALL AND LARGE SCALES 

In the previous section wc have shown that the power in the CMB anisotropics at large enough I measures the 
power spectrum of the deflection angle SO. If we want to use the small scale power to measure the fluctuations in 
the deflection angle we need a way to determine that the power we observe is indeed produced by lensing and is not 
at the last scattering surface (ie. it is due to a different underlying primary anisotropics spectrum) or that it is not 
produced by another secondary effect. To be able to test this we need to consider higher order statistics, we need to 
go beyond the power spectrum. 

The crucial point we are going to use is that the power produced by lensing comes from large scale primary 
temperature modes promoted to higher / by the small scale deflection of the photons. We will attempt to test the 
hypothesis that the power we observe is generated by lensing by looking at the correlation between the large scale 
temperature derivative and the small scale power. If we continue with the simplification that the gradient of the 
unlensed field is constant, when we high pass filter the temperature map we obtain, 

^ f, 66^(9) + fyS9y{e), (10) 

where Ihi and lh2 define a ring in I space and we have abused notation and called 59i{6) the high passed filtered 
deflection angle. We will study the behavior of the square of Th properly smoothed, 

n{e)= I d?-e'WH{e-0')T]j{e'). (ii) 

If we smooth the map over a scale bigger than that of the variations in the deflection angle but over which T^ and 
Ty remain constant, we can replace 5&1. and 59^ by their averages. It follows that the smoothed high passed filtered 
map is approximately, 

n{e)^^^{n + f^){e). (12) 



Only the modes with wavevectors Ihi < I < ih2 contribute to the average {S9^) in equation (|l2[). 

We now look at the anisotropics on larger scales by applying a low pass filter to the temperature field. 



(27r)2 



TUB) = I TT^e-^'-^ni). (13) 



We compute the square of the gradient. 



£(0) = ||VTi|p 

«(f2+f2)(0). (14) 

Equations (|2|) and (|lj) show the maps of C and Ti. trace each other, 

me) « ^£(0). (15) 

We have a very simple test to establish that the small scale power is generated by lensing, the small scale power has 
to be correlated with the large scale derivative. Instead if the small scale power is Gaussian power already present at 
the last scattering surface, different Fourier modes are uncorrelated. In the Gaussian case if we construct C and Ti. 
with different modes (Ihi > h) there will be no correlation between the two maps. In practice instead of smoothing 
the high passed temperature map we work in Fourier space and correlate the low I modes of T^ (6) with those of the 

Equations dl3) and (H) were obtained under simplifying assumptions, more general expressions can also be derived 
but we only show this limiting case to make the physics more transparent. Comparison with simulations show that 
this expressions are good approximations in the regime of interest. The procedure to simulate a lensed CMB map 
was discussed in detail in reference |q]. We generate realizations of CMB and projected mass density k and use a ray 
tracing technic to produce a lensed CMB map. 



„ 500 

^ 

^-500 

-1000 



1 1 1 1 1 r 




^[degrees] 



10 '- 



^ 5 

E- -5 
-10 




^[degrees] 




^[degrees] 



FIG. 4. Example of the generation of power by gravitational lensing in one dimension. The upper panel shows the unlensed 
temperature and the result of filtering the lensed T. The middle panel shows the high I power generated by lensing. In the 
bottom we show the square of the small scale power and the square of the large scale gradient (arbitrarily scaled). 



We will explain the effect we are studying by first presenting an toy example in 1 dimension. We will consider a 
2 dimensional universe so that the last scattering surface becomes a line. In the top panel of figure Q we show the 
unlensed CMB field together with the low passed filtered {h — 2000) lensed anisotropies. It is clear that both curves 
trace each other, although they are displaced. This is the consequence of the large scale modes of the deflection angle, 
but remember we are after the power generated by the small scale modes of 66. In the middle panel we show the high 
passed {I > 6000) lensed CMB, the fluctuations in this panel are the result of lensing. The power is not distributed 
uniformly, the fluctuations are larger where the derivative of the CMB in the top panel is larger, as equation (n3) 
indicates. This is the same physical effect we discussed in the cluster example. On the contrary in regions where the 
CMB is constant, surface brightness conservation implies that lensing does not create any power. The bottom panel 
shows the square of the small scale temperature anisotropies and a scaled version of the low I gradient to ease the 
comparison. 

We can now look at the results for a 2 dimensional last scattering surface shown in figure ^. The low passed gradient 
map was constructed using scales I < 2000 and the high passed temperature was constructed with modes / > 6000. 




FIG. 5. The upper panel on the left shows the lensed temperature field. The upper right panel shows the high passed 
temperature. Bottom left has gradient square and bottom right smoothed square of high passed temperature. 

The upper left panel shows the lensed CMB map while the upper right panel has the high passed filtered field. It is 
clear that the small scale power in not Gaussian and is higher where the large scale derivative is higher. To make this 
even more apparent the two bottom panels show the L, and 7i maps, the correlation is excellent. 

The £ and Ji maps only differ by a factor {59'^) / 2 (equation HSh, so their cross correlation defined as {Ti{li)C{l2)) = 
{2tt)'^ 5^ {li2)C]^'-' , is simply related to the power spectrum of the C map (Cf^), 
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In figure ^ we show the ratio C]^'~ /Cf'' obtained in simulations. We have used / < 2000 to construct C, and several 
different rings at higher I for Ti. The order of magnitude of the ratio is consistent with equation (^^, it remains 
constant over a wide range of I but starts to fall once we approach / ~ 2000. The first relation in equation ([iq ) is really 
C]^'' = Wi{S9'^)Cf^ /2 where Wi is some window function. The origin of the window can be understood as follows: 
lets call Ifi a high I mode of the deflection and h and I2 two modes of the derivative. When we compute the gradient 



square this modes combine to give power at Z3 = Zi + Z2. In the high passed case, we recover this component of the 
field by looking at the modes with Ih + li and —Ih + I2 so that when we multiply them they give the I3 variations. 
As I3 becomes larger at least one of li or I2 also become larger and because we are selecting only a ring in I space 
for the high passed map there are higher chances either Ih + h or — Z^ + I2 will fall outside the ring and we cannot 
reconstruct I3. So as Z3 gets larger the correlation between the two maps fall. This effect explains the window Wi 
is less important the wider rings for the high passed maps is (compare the high / behavior of the 4000 — 12000 and 
4000 - 8000 rings in figure |a). 

In figure Ob we show the correlation coefficient Ci = C]^^ / \/ {Cf^'^Cf^) . Note that the cross correlation for low 
I is very high, almost one. This proves our claim that on large scales the £ and Ti. maps trace each other almost 
perfectly. There are several reasons why the two maps do not correlate exactly. First, although most of the power 
in the derivative of the CMB comes from I < 2000 there is still about 5% additional power coming from higher I. 
This degrades the cross-correlation because some of the power on the high passed map is coming from these modes 
of the derivative field. A more extreme case is shown in the figure, where only modes with I < 1000 were used to 
construct the C map, the cross-correlation in this case is significantly smaller. The other important effect is that some 
of the power in the high / map is due to primary anisotropics, these modes are uncorrelated with the low I derivative 
and thus reduce the cross correlation coefficient. This effect is more important for the 4000 — 8000 ring than for the 
8000 — 12000 so the cross correlation is smaller for the former. The 4000 — 12000 rings falls somewhere in the middle 
of this two cases. 

We will calculate Cf'-' , which can be done analytically. If we assume that only a Gaussian component is contributing 
to the low pass filtered field then it is possible to calculate the power spectra of the gradient squared, £ = HVTlIP. 
The correlation function in real space for two points separated an angle 6 in the x direction gives, 

(£(O)£(0)) = {CY + '^iClM + Cl^{e)). (17) 

We have introduced C,^,AO) = {TxiO)TA9)) and Cyy{9) = {Ty{0)Ty{9)) . They are given by, 

C(0) = {fAO)f,{e))cMB 

l-l, 



^{27rr' f'd^ I e''-'-°'^^fcr cos' < 

^iy-^i'cr[Jom-j2{io)] 

Cyy{e) ^ {fy{O)fy{0))cMB 

47r 
^\[Coie) + C2{9)] 

a,y{e) = {fAo)fy{e))cMB 

rk 
-2 / j27 il-0cos4>i i2y-iTT 



= (27r)-^ / dH e"-*^™^^' I'Cl' cos 4>isuv4>i 

= 0, (18) 

where Cq{6) and C2{0) are defined as the integrals over l^Cf^dl/2TT weighted with Jo{W) and J2{W), respectively. 

The constant part in equation ( [l7| ) only contributes to the I = mode. The second term is agN'^'^{d) in the 
notation of g] . The power spectra of the low pass filtered map is that of 5 in |6| , but with only the low I modes 
included. For / 7^ 0, 

Cf^ = 27r / ede{C^ + Ci)JoiW). (19) 

This calculation coincides perfectly with the results of simulations. In the presence of detector noise the above 
expression generalizes to Cf^ -^ Cf'-' + Nf^, where Nf'-' is the detector noise contribution calculated using equations 
(O) and (jl^) with the power spectrum of the detector noise. 
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FIG. 6. Panel (a) shows the ratio Cj^ /Cl"^ for several different rings used to construct Ti.. Panel (b) shows the cross 
correlation coefficient in I space. 



11 



Let us now consider C'j^'^ . If the contribution to the power where Gaussian then the correlation function of Ti in 
real space would be, 

{n{o)n{e)) = {n^f + 2C\0) (20) 

where, 

C{9) = {fH{Q)fH{e))cMB 



'--^ Idl 



-crMie)- (21) 

The power spectrum becomes, 

Q«« = 47r /" 9deC^{9)Jo{ie) (22) 

This expression only coincides with the results of simulations when either detector noise or intrinsic CMB anisotropies 
dominate the power in the I range used to construct H, that is if the unlensed power is Gaussian. In the idealized 
detector noise free examples we have discussed above this corresponds to lh2 < 4000. Fortunately for the power 
generated by lensing, the power spectra of the Ti. map is a scaled version of the power spectrum of C, thus it can be 
obtained using equations dlq) and ( J2l| ) . Again in the presence of noise Cj^^ -^ C'j^^ + A^/^'^ with A"/^ '^ calculated 
using equation (E2|) with the detector noise power spectra. 

To assess the signal to noise of our lensing signal we need to calculate the variance in the estimator of C]^'', 

^■"'-WpT-I^'^ma.-,). (23) 

where the integral is done over a small area in I space of size Ai centered around / and Af = (27r)^/f7 is the area of 
the fundamental cell in I space. We have denoted fl the area of sky observed. 
If we assume that C and Ti. are Gaussian fields, we can calculate the variance as, 

CoviiCrf] = ^[{Cn' + Cl'^'Cf^]. (24) 

The ratio A/Af is the number of available modes one can use to measure the cross correlation. It can be approximated 
by Ai/Af — fsky{2l + 1) for a unit width ring. In figure [7| we show the variance in the cross correlation measured in 
simulations normalized to the Gaussian prediction in equation (|24|). The agreement is very good implying that we 
can use the Gaussian formula to compute the variance. 

To compute the total S/N we want to combine the signal in all the I modes of the cross correlation that we can 
measure. To do this we will compute X — J^i (^iCV'^ choosing ai to maximize S/N= [(X)^/Cov(X^)]^/^. It is straight 
forward to show that a, = C;'"^/Cov[(C;^^)2], Q. We then get. 
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^, icrr "^ 



.Kl, 



Cov[(C«^)2] 



Uy g(2Z + 1) ((1 + iV,"/Cf ^)(1 + Nr^/Cr^) + Cf) 



1/2 

(25) 



We have implicitly assumed rings of unit width. 

We want to relate equation (|2^) to the S/N with which we can measure the small scale power generated by lensing. 
It is straightforward to show using equations (|l|), (H) and the fact that C; ~ 1 that N^'^/Cf^'^ « (w^VC*^^)^- here 
C"^"^ is the averaged power in the band l^i < I < lh2 and w^ is the power spectrum of the detector noise, assumed 
to be white noise. 

We will consider the limit in which Wj^^ is small enough that Nf'^ <C Cf'' but is large enough that A"/^^ S> Cj^^. 

1 /? 
If the noise where even lower then the S/N for detecting the cross correlation will be large S/N~ Nj^' where Nl is 

the total number of cross correlations that can be measures, N^ ~ fskylf- In the limit we are considering, 
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FIG. 7. Ratio of variance of power spectrum measured in the simulation to the predicted result assuming Gaussian fields. 
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(26) 



where we have introduced (■^)p 



Np/2{Ci/wrp )^, the signal to noise with which we can measure the power in the 



band I hi — lh2- We call Np the number of modes that we can use to estimate the power, which is relate to TV^ by, 

'■I 



Np 






/2 



(27) 



Thus the S/N to measure the cross correlation is comparable although always somewhat smaller than that to detect 
the power directly. For example if we take /; = 2000, hi = 4000 and lh2 = 6000 then SjN k. ./2jZCi{S / N) p . 



IV. THE THREE AND FOUR POINT FUNCTIONS IN THE SMALL ANGLE LIMIT 

In this paper and in our previous studies |q,0,EO[ we have investigated several ways of detecting the effect of 
gravitational lensing on the CMB. As we have explained this amounts to trying to detect the distortions on the 
random CMB maps created by the random distributions of the dark matter in the universe. In prt we used the ISW 
effect as a tracer of the dark matter distribution and combinations of the CMB derivatives to measure the effect of 
lensing. The cross correlation of these two effects allowed us to gain information about the time evolution of the 
gravitational potential. Our method combined the information in particular configurations of the three point function 
of the temperature. Other studies have used other combinations of the bispectrum to detect the signal M and also 
calculated the contributions to the bispectrum coming from other secondary procesess |H,p[- 

In Iq] we used the power spectrum of quadratic combination of derivatives of the CMB to measure the power 
spectrum of the projected mass density k. This method was valid in the limit in which we wanted to recover the 
long wavelength modes of k from information in the small scale CMB. This regime is analogous to weak lensing of 
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background galaxies. In essence the different estimates of the power spectrum of k at different scales were obtained 
by combining different configurations of the four point function of the lensed temperature. 

In the present paper we studied other configurations of the four point function to illustrate the nature of the non- 
Gaussianities induced by lensing on small scales. The non-Gaussian nature of the generated power manifested itself 
in the correlations between the large scale gradient and the small scale generated power. 

In order to have a unified picture of the different statistics we have proposed it is convenient study directly the four 
point function of the temperature field and a three point function which correlates two temperatures and another 
field X. The field X stands for any field that cross correlates with k. In our paper jlO| X = T but one can imagine 
doing this correlation with other tracers of the mass, like the fluctuations of the Far Infrared Background |0]. 

We define the connected three and four point functions as, 

{X{h)T{l2)T{l3))c - (27r)2^^(ii23)T3(Zi,Z2,Z3) 

{T{h)T{h)T{h)T{h)), = {2nf5''{h2Zi)Ti{luh,h,h). (28) 

Gravitational lensing produces, 

'^3(^1,^2,^3) = ZQ'i [C12 —p2 '" ^/3 ,2 ] 

ff riff 



Ti{li,l2,l3,h) — Ci^ Cj 






+ permutations (5 terms proportional to C// Cl/ , C// C// , C// C/3' , C// C// , C// C// ). (29) 

The unconnected part of the four point function also gets corrections. To make the calculation of these terms fully 
consistent up to second order in the deflection angle we need to also consider the contributions coming from the 
second order in the expansion of equation y^ . The unconnected terms are not relevant for our study so we will not 
write them down here. 

In our previous papers we introduced three variables £, B and S. We had defined them in terms of derivatives to 
the temperature field. Equivalently we can write. 



(27r) 
Q(0 = / ^[(^. - h^)h^ - {ly - liy)liy]T{l - h)T{h) 

U{1) = J -^[{Ix - hx)hy + [ly - hy)hx\T{l - h)T{h) 

£{l) = Q{l)cos{2(l)i)+U{l)sm{2<j)i) 



B{1) = -Q{l)sin[2cj)i)+U{l)cos{2(j)i) (30) 

When we average over the CMB random field we get, 

{S{l))cMB = ((2^)'5^(0 - 2K{l))as 

{£{l))cMB = -2K{l)as 

{B{l))cMB = 0. (31) 

We have introduced erg — J ldl/27r PCf^ . 

To extract all the information in this three point function se combine all possible configurations with a weight /3 
chosen to maximize the signal to noise ratio. We define, 

^ ^ (^ / ^^'^^^^' ^'' ^^^ X(Zi)T(Z2)T(Z3). (32) 

For these mildly non-Gaussian maps, the variance can be calculated by only taking the Gaussian part of the temper- 
ature, so that 
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Var(y) ^ Af{2nf / ^^P\S) 2 Qf^^f Qf. (33) 



The power spectra in equation ( |33D must include the contribution from detector noise. There are additional terms in 
the variance if the field X and T had some cross correlation before lensing. In practice these terms are unimportant 
if one is interested in measuring the cross correlation C^^ at large angular scales (low I), as was the case in our study 
in pel ]. This is so because most the information of lensing is encoded in the high I modes of the temperature, so it is 
effectively as if the integral over I2 in equation ( p2[ ) is done over high I modes while the li integral only involves low 
I. Thus the terms that would involve Cf^'^ are absent because there are no pair of triangles in which X and T are 
evaluated on the same I. 

The weight P that maximizes the S/N is (3 (x Tz{li,l2,h)/2C^^CfJ'Cj'J' . Finally we get, 

{if = A-^ I dHj '"'' ^^' 



N> f J 'J (2^)2 2C,f ^C,7C;7 



= A7Wd^z,-4(^'")^ 



h r^ft ^2 ■ li , ^tt h ■ h -f2 1 foA\ 



Cff J (2^)2^ '" ll '' ll 2CTT{l^)CTf(^i^ 

The power spectra in the denominator include the contribution from detector noise (amplified by the beam response 
Ci ^ Ci + B^Ni. The easiest way to calculate Cf is to use a Monte Carlo technique. We used the implementation 
of the VEGAS algorithm in Numerical Recipes |1J] . 

The above formula can be compared to the what we obtain using the S and £ variables ||l^. Equation (7) of pX| ] 
read, 

i^r - J d%'-^Wmj^ + ^), (35) 

where W'^{1) is a window that encapsulates the effect of beam smearing. For low I Nf'^ and Nf^ are constant and 
satisfy, Nf^ = 2Nf^. 

In figure H we compare l/Cf to W^{1){^^^ + -j^r)- We show the results for two separate examples, an ideal 
experiment with no noise and infinite resolution and the Planck satellite. We focus on the large scale limit and for 
the ideal experiment we only consider the information coming from modes with I < 3000. There are several salient 
features of the comparison. Although the difference between the methods is not so large for Planck it is much larger 
for the ideal experiment. This can be easily understood. In our previous method the power spectrum of the CMB 
noise in this limit was, 

NfS =. {2n)l^-^i^^xX. (36) 

{JlHlCf^Y 

It is clear from equation ( pq ) that once we get into the damping tail where the Cf" fall exponentially, N^^ no longer 
changes which means that our method does not receive any information from those modes. In contrast equation (p4) 
shows that the amplitude of the Cf^ cancels in Cf as long as the modes had been measured with high S/N . Thus 
the new method continues to extract information from modes in the damping tail. In most practical cases this is 
not very important because the detector noise quickly dominates in this regime and then all methods downweight 
the modes. This explains why the difference between the two methods is not that large for Planck. There is another 
relevant consideration. When computing the variance we assumed that the field was only mildly non-Gaussian and 
that we could take the unlensed temperature power spectrum to calculate it. This is clearly not the case on small 
scales. As we have shown in previous sections, on small scales the fluctuations become very non-Gaussian as most of 
the power is generated by lensing. We only consider modes with I < 3000 for the calculation of Cj to partially take 
into account this effect. 
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FIG. 8. Comparison between Cj and the noise of our old method. We show the results for the Planck satellite and an 
ideal experiment. For our old method there was no difference in the noise between Planck and an ideal experiment on these 
angular scales. 
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(a) 




FIG. 9. Quadrilaterals corresponding to the two limits discussed in the text. Panel a corresponds to configurations relevant 
for the cross correlation between the large scale gradient and the small scale power. Panel b shows the configurations that enter 
in the calculation of S and £. 

A. The four point function 

In the first part of this paper we have studied one particular physical hniit when we are trying to recover information 
on the fluctuations of the mass distribution on scales much smaller than the coherence length of the CMB. To recover 
this limit we have to consider a quadrilateral in which two sides are much smaller that the other two (figure Oa). The 
two small sides correspond to the low pass filtered derivatives while the large I correspond to the high passed filtered 
one. We consider the case where hjh ^ h ^ h- As we noted before the power spectrum of the primary anisotropics 
decreases exponentially while that of the deflection angle is only as a power law. We conclude that of all the terms in 
equation (E9h only those explicitly written dominate, 

Ti{li,l2,h,h) ~ '2, j2 ^h^h Q2 ' ^1,^2 < ^3 ~k, (37) 

'3 

where we approximated Cf^ ~ Cf^ . 

A different set of quadrilaterals dominate in the calculation of £ and S. Those variables extract information 
about the large scale k fluctuations from small angular scale fluctuations in the CMB. If we focus on modes of the 
temperature on scales larger that the damping tail t^Cf remains approximately constant while the power spectra 
of the deflection angle falls. Moreover S and £ are combination of derivatives of the CMB and the extra ^s weigh the 
contribution to smaller scales. The power spectra of S and £ are dominated by the type of quadrilaterals shown in 
figure 0b. All the Is are large but the quadrilaterals are thin. The thin diagonal corresponds to the I of the k mode 
being recovered. The terms in equation (GS) proportional to Cf^ Cf^ Cf^ (where Zi and I2 represent the length of 
the sides and Z12 is the small diagonal) dominate. 

To extract all the information in the four point function we can add all the quadrilaterals with an appropriate 
weight, 

f f dH, d% d%^^^^ T{h)T{h)T{h)T{h), (38) 



(2^)2 y Ai Ai A 
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where I4 = —{h + 12 + h) and Ai is the area in / space we are using. For the optimal filter that minimizes S/N one 
gets (3 ex Til Cfi^ €'[2'' Cf^ Cf^ , where we have assumed Gaussianity to compute the variance. For the S/N we get, 

(^)2 ^ Jj_ [d%d%d%^MllkM^, (39) 

N 24{2tt)^ J -^ ^TT (jTT (jTT (jTT ' ^ ' 

where the power spectra in the denominator should include the contribution form detector noise. 
We change integration variables in equation (|39|) and write, 



^f^> ^ } 'cff^ 



^q//^ -24 7 (2^)2 (2^)2 c'Tt^^C,t%|^C'/;^^g(4C'--)2' 

This is a useful change of variables because it makes the integral resemble what we had in our old method. In this 
way the limit ^1 — > corresponds to quadrilaterals that have four large sides but a small diagonal (li). Equation (6) 
of 1^ reads. 



'N 



'-AJ' I d^Z^^r^, (41) 

Ci 



with tjp^ ~ ^c^^ "*" ^c^^^ "*" '^c^^ ■ 

In figure 10 we compare the results C; in (40) with aci (^it). The plot is qualitatively similar to figure H for 
the three point function. While there are hardly any improvements in our previous method when wc go from Planck 
to an ideal experiment, there are significant differences for the optimal filter which continues to gather information 
from the damping tail. For Planck the situation is different, both the optimal method and our old method obtain a 
similar amount of information from the data. Even though the optimal method is able to get information from the 
damping tail in the ideal case, this is unimportant for Planck because the finite size of the beam makes it impossible. 
The fact that our previous method seems to have slightly less noise than the optimal method when I is a few hundred 
is most probably an artifact. The noise in our old method was calculated using a Gaussian approximation which se 
had seen braking down slightly in our simulations . As for the three point function we only included modes of the 
temperature with I < 3000 as the power generated by lensing is non-Gaussian so our estimate of the variance of Z is 
not valid on smaller scales. 

In the limit li —> the four point function becomes approximately, 

^4(^1 ~ ^2,^2,^37— ^1 — ^3) ~ —^2 Q3 'iQi • (42) 

To obtain equation (H3) we had to assume that Cj^'^ is a decreasing function of I and that CT^ , « C^'^ and the 
equivalent formula for l^. Both of these assumptions break down in some range of Is. For example, Cj^'^ has a peak 
at / '^ 100 and when I2 is in the damping tale range, for finite li there might be corrections due to the difference 
between CT^ , and C^^ . We can still use this expression as a rough estimate to try to compare how the optimal 
/3 compares with the weight used by our previous method. In this limit and for the quadrilaterals relevant for this 
statistic, the optimal /3 is equivalent to multiplying each of the temperatures by {Cf^Y^"^ /{Cf^ + BfNf^). Thus for 
a temperature power spectra that goes as Cf" ex l~^ and when detector noise is irrelevant, the optimal filter amounts 
to multiplying the temperatures by I, equivalent to taking derivatives. This is the reason our previous method is 
not far from optimal in situations where we can neglect detector noise and we are not trying to extract information 
form the damping tail of the CMB. As we mention when we discussed the 3pt function, on small enough scales our 
treatment of the noise breaks down because the power is dominated by the power generated by lensing. The Gaussian 
approximation for the noise will not be valid. 

V. CONCLUSIONS 

We have studied the generation of power by gravitational lensing on small angular scales. We have shown that the 
power spectra of the anisotropics gives a measure of the spectrum of the deflection angle. The generated power is 
correlated with the size of the large scale gradient. 
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FIG. 10. Result of Monte Carlo for 4pt function together with result from previous technique. 
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The generation of power can be understood by studying the lensing of the primary anisotropics by a cluster of 
galaxies. On the scales of a cluster the CMB can be assumed to be a simple gradient. Lensing generates a wiggle on 
top of the gradient that can be tens of ^iK. This signal will be large enough to be detected by a CMB experiment 
which targeted clusters with sufficient angular resolution, ~ 1 arcmin. 

The lensing effect produced by the large scale structure of the universe can be separated from other secondary 
effects or from intrinsic CMB anisotropies at the last scattering surface by measuring the cross correlation between 
the map of the large scale gradient and the map of the small scale power. We have shown that this statistic has only a 
slightly smaller signal to noise than the measurement of the small scale power itself. The power generated by lensing 
dominates over the intrinsic fluctuations for I )k 4000. 

We have calculated the three and four point function of the lensing field in the small angle limit. The cross 
correlation between large and small scales as well as the statistics introduced in B are particular combinations of the 
four point function of the temperature field. We have calculated explicitly the dependence of the three and four point 
functions on the CMB and deflection angle power spectra as well as on the shape of the configuration. It is fair to 
say that both the statistic introduced here and those used in ||6| can be viewed as particular ways of compressing the 
information in the four point function that take into account the physical intuition coming from our understanding 
of the lensing effect. The lensing effect predicts a particular dependence of the four point function on configuration 
and scale that can be used to separate it from other non Gaussian signals. 
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